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631. 


SYNOPSIS OF THE THEORY OF EQUATIONS. 


[From the Messenger of Mathematics, vol. v. (1876), pp. 89—49.] 


THE following was proposed as one of the subjects of a Dissertation for the 
Trinity Fellowships: 

Synopsis of the theory of equations; ie. a statement in a logical order, of the 
divisions of the subject and the leading questions and theorems, but without demonstrations. 

In the subject “Theory of Equations,” the term equation is used to denote an 
equation of the form a” — pa" +...+pn,=0, where p, p.,.., Pn are regarded as known, 
and # as a quantity to be determined ; for shortness, the equation is written f(x) =0. 

The equation may be numerical; that is, the coefficients p,, p.,.., Pn are then 
numbers; understanding by number, a quantity of the form a+ ĝi, where « and 8 have 
any positive or negative real values whatever; or say, each of these is regarded as 
susceptible of continuous variation from an indefinitely large negative to an indefinitely 
large positive value: and 7 denotes /(—1). 

Or the equation may be algebraic; viz. the coefficients are then not restricted to 
denote, or are not explicitly considered as denoting, numbers. 


I. We consider first numerical equations. 

A number a (real or imaginary), such that substituted for æ it makes the function 
a" — pa" +... tpr to be =0, or say, such that it satisfies the equation, is said to 
be a root of the equation; viz. a being a root, we have 

a” —p,a" + ...+ pn =O, or say f(a) =0; | 
and it is then shown that w—a is a factor of the function f(e), viz. that we have 
f (x) = (« — a) fi(æ), where f,(@) is a function 2°" — qa" + ... £ Qn, of the order n—1, 


with numerical coefficients Qu, g2,++» Yn—: 
1—2 
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In general, a is not a root of the equation f\(~7)=0; but it may be so, viz. 
ti(#) may contain the factor æ—a; when this is so, f(#) will contain the factor 
(e—a); writing then f()=(«#-—a)f,(2), and assuming that æ is not a root of the 
equation f,(#)=0, «=a is then said to be a double root of the equation. Similarly, 
J (#) may contain the factor (æx—a)} and no higher power, and then #=a is said to 
be a triple root; and so on. 

Supposing, in general, that f(«)=(x—a)* F(s), where a is a positive integer which 
may be =1, and Fw is of the order n—a, then if b is a root different from a, we 
shall have «—b a factor (in general a simple one, but it may be a multiple one) of 
F(s), and f(a) will in this case become =(x#—a)*(a—b)? B(x), where £ is a positive 
integer which may be =1, and a is of the order n—a—f. The original equation 
fe =0 is in this case said to have a roots each =a, B roots each =b, and so on. 


We have the theorem, a numerical equation of the order n has in every case n 
roots, viz. there exist n numbers a, b,... (in general, all of them distinct, but they 
may arrange themselves in groups of equal values) such that 


J (@) = (#@—a)(@—b)(x—c)... identically. 


If an equation has equal roots, these can in general be determined; the case is at 
any rate a special one, which may be here omitted from consideration. It is there- 
fore, in general, assumed that the equation f(~)=0 under consideration has all its 
roots unequal. If the coefficients p,, p.,... are all or any one or more of them 
imaginary, then the equation f(x)=0, separating the real and imaginary parts, may 
be written F (æ)+i® (<)=0, where F (e), (s) are each of them a function with real 
coefficients; and it thus appears that the equation f(v)=0 with imaginary coefficients 
has not in general any real root; supposing it to have a real root a, this must be 
at once a root of each of the equations F (s)=0 and Ẹ® (æ)= 0. 


But an equation with real coefficients may have as well imaginary as real roots; 
and we have further the theorem that for such an equation the imaginary roots enter 
in pairs, viz. a+ i being a root, then will also a— i be a root. 


Considering an equation with real coefficients, the question arises as to the number 
and situation of its real roots; this is completely resolved by means of Sturm’s 
theorem, viz. we form a series of functions f(z), f’(«), fo(#),.., fr(#) (a constant) of 
the degrees n, n—1,.., 2, 1, O respectively; and substituting therein for # any two 
real values a and b, we find by means of the resulting signs of these functions how 
many real roots of f(#) lie between the limits a, b. 


The same thing can frequently be effected with greater facility by other means, 
but the only general method is the one just referred to. 


In the general case of an equation with imaginary (it may be real) coefficients, 
the like question arises as to the situation of the (real or imaginary) roots, viz. if 
for facility of conception we regard the constituents a, B of a root a+ zi as the 
coordinates of a point im plano, and accordingly represent the root by such point; 
then drawing in the plane any closed curve or “contour,” the question is how many 
roots lie within such contour. 
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This is solved theoretically by means of a theorem of Cauchy’s, viz. writing in 
the original equation «+7y in place of æ, the function f (æ + iy) becomes =P + iQ, 
where P and Q are each of them a rational and integral function (with real coefficients) 
of (x, y) Imagining the point (#, y) to travel along the contour, and considering the 


number of changes of sign from — to + and from + to — of the fraction 5 corre- 


sponding to passages of the fraction through zero (that is, to values for which P 
becomes = 0, disregarding those for which Q becomes =0), the difference of these 
numbers determines the number of roots within the contour. The investigation leads 
to a proof of the before-mentioned theorem, that a numerical equation of the order 
n has precisely n roots. 


But, for the actual determination, it is necessary to consider a rectangular contour, 
and to apply to each of its sides separately a method such as that of Sturm’s 
theorem; and thus the actual determination ultimately depends on a method such as 
that of Sturm’s theorem. 


Recurrmg to the case of an equation with real coefficients, it is important to 
separate the real roots, viz. to determine limits, such that each real root lies alone 
by itself between two limits l and m. This can be done (with more or less difficulty 
according to the nearness of the real roots) by repeated applications of Sturm’s 
theorem, or otherwise. 


The same thing would be useful, and can theoretically be effected, in regard to 
the roots of an equation generally, viz. we may, by lines parallel to the axes of 
æ and y respectively, divide the plane into rectangles such that each (real or imaginary) 
root lies alone by itself in a given rectangle; but the ulterior theory, even as regards 
the imaginary roots of an equation with real coefficients, has not been developed, and 
the remarks which immediately follow have reference only to equations with real 
coefficients, and to the real roots of such equations. 


Supposing the roots separated as above, so that a certain root is known to lie 
alone by itself between two given limits, then it is possible by various processes 
(Horner’s, or Lagrange’s method of continued fractions) to obtain to any degree of 
approximation the numerical value of the real root in question, and thus to obtain 
(approximately as above) the values of the several real roots. 


The real roots can also frequently be obtained, without the necessity of a previous 
separation of the roots, by other processes of approximation—Newton’s, as completed 
by Fourier, or by a method given by Encke—and the problem of their determination 
to any degree of approximation may be regarded as completely solved. But this is 
far from being practically the case even as regards the imaginary roots of such 
equations, or as regards the roots of an equation with imaginary coefficients. 


A class of numerical equations which need to be considered, are the binomial 
equations «"—a=0, where a, =a + ĝi, is a complex number. The foregoing conclusions 
apply, viz. there are always n roots, which it may be shown are all unequal. Supposing 
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one of these is 0, so that 6"=a, then, assuming x=6y, we have y”—1=0, which 
equation (like the more general one a—a=0) has precisely n roots; it is shown 
that these are 1, œ, w%.., œ”, where œw is a complex number a+ % such that 
a?+6?=1, or, what is the same thing, a complex number of the form cos @+7sin 0; 


and it then at once appears that @ may be taken = st . We have thus the 


trigonometrical solution of the equation #*—1=0. We may also obtain a like 
trigonometrical solution of the first-mentioned equation «*—a=0. We are thus led 


1 
to the notion (æa numerical) of the radical a”, regarded as an n-valued function, viz. 
any one of these being denoted by ,/(a), then the series of values is 


v(a), oy (a). © y (a). 
1 
Or we may, if we please, use ¥/(a), instead of a”, as a symbol to denote the n-valued 
function. 


It is not necessary, as regards the equation æ” — 1 = 0, to refer here to the 
distinctions between the cases n a prime, and a composite, number. 


As the coefficients of an algebraical equation may be numerical, all which follows in 
regard to algebraical equations, is (with, it may be, some few modifications) applicable 
to numerical equations; and hence, concluding for the present this subject, it will be 
convenient to pass on to algebraical. equations. 


II. We consider, secondly, an algebraical equation 
a” — pc" + ...=0, 


and we here asswme the existence of roots, viz. we assume that there are n quantities 
a, b, c,... (in general, all of them different, but in particular cases they may become 
equal in sets in any manner), such that 


a” — po" +... = (æ — a) (x — b)... 


Or, looking at the question in a different point of view, and starting with the roots 
a, b, c,... as given, we express the product of the n factors w—a, æ—b,... in the 
foregoing form, and thus arrive at an equation of the order n having the n roots 
a, b, c,..... In either case, we have 


pi =a, p,=ab,.., Pn = abe... 


viz. regarding the coefficients pı, p.,.., Pn as given, then we assume the existence of 
roots a, b, c,... such that p,= a, &., or regarding the roots as given, then we write 
Pi, Po, &e., to denote the functions La, Lab, &e. 


It is to be noticed that, in virtue of 


a” — pa + .,,.=(¢—a)(a—b), &e., 
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or of the equivalent equations p,= a, &c., then 
a” — pa" + ...=0, 
b* = pbP) +... =O, 
&e., 
(viz. it is for this reason that a, 6,... are said to be roots of a” — pa +... =0); 


and, moreover, that conversely from the last-mentioned equations, assuming that a, b,... 
are all different, we deduce 


pi = Xa, pı = ab, &c., 


and 

a" — pa" +... =(#—a)(w@—b).... 
Observe that, if for instance a=b, then the two equations a”—p,a""+...=0, 
b” — p,b”*+...=0 would reduce themselves to a single equation, which would not of 


itself express that æ was a double root, that is, that (w—a)*? was a factor of 
a” — pa" + &e.; but by considering b as the limit of a+h, h indefinitely small, we 
obtain a second equation 

na" — (n — l) pa"? +... = 0, 
which, with the first, expresses that a is a double root; and then the whole system 
of equations leads, as before, to the equations p,= a, &c. But this in passing: the 
general case is when the roots are all unequal. 

We have then the theorem that every rational symmetrical function of the roots 
is a rational function of the coefficients; this is an easy consequence from the less 
general theorem, every rational and integral symmetrical function of the roots is a 
rational and integral function of the coefficients. 

In particular, the sums of powers Ža’, ta’, &., are rational and integral functions 
of the coefficients. 

An ordinary process, as regards the expression of other functions Ya*b®, &c., in 
terms of- the coefficients, is to make them depend on the functions Ya*, &c., but this 
is very objectionable; the true theory consists in showing that we have systems of 
equations 


p = 24, 
P = Lab, 
l p? = Ya? + 22ab, 
ps = Labe, 
DP: = Zab + 3zabe, 
pè = La’ + 320° + babe, 
&e., &e. 


where, in each system, there are precisely as many equations as there are root-functions 
on the right-hand side, eg. 3 equations and 3 functions abc, Zab, Za’. Hence, in 
each system, the root-functions can be determined linearly in terms of the powers and 


products of the coefficients. 
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It follows that it is possible to determine an equation (of an assignable order) 
having for roots any given (unsymmetrical) functions of the roots of a given equation. 
For example, in the case of a quartic equation, roots (a, b, c, d), it is possible to find 
an equation having the roots ab, ac, ad, be, bd, cd, being therefore a sextic equation; 
viz. in the product (y — ab) (y—ac) (y — ad) (y — be) (y — bd) (y — cd), the coefficients of 
the several powers of y will be symmetrical functions of a, b, c, d, and therefore 
rational and integral functions of the coefficients of the original quartic equation. 


In connexion herewith, the question arises as to the number of values (obtained 
by permutations of the roots) of given unsymmetrical functions of the roots; for instance, 
with roots (a, b, c, d) as before, how many values are there of the function ab + cd; 
or, better, how many functions are there of this form; the answer is 3, viz. ab + cd, 
ac+bd, ad+bc; or, again, we may ask whether it is possible to obtain functions of a 
given number of values, 3-valued, 4-valued functions, &c. 


We have, moreover, the very important theorem that, given the value of any 
unsymmetrical function, eg. ab+cd, it is in general possible to determine rationally 
the value of any similar function, eg. (a +b) + (c+ dY. 


The @ priori ground of this theorem may be illustrated by means of a numerical 
equation. Suppose, eg. that the roots of a quartic equation are 1, 2, 3, 4; then if it 
is given that ab+cd=14, this in effect determines a, b to be 1, 2 (viz. a=1, b=2, 
or else a=2, b=1) and ¢,d to be 3, 4 (viz. c=3, d=4, or else c=4, d=8); and 
it therefore in effect determines (a+6)'+(c+d)> to be =370, and not any other 
value. And we can in the same way account for cases of failure as regards particular 
equations; thus, the roots being 1, 2, 3, 4, as above, ab=2 determines a to be =1 
and b to be =2; but if the roots had been 1, 2, 4, 16, then a%=16 does not 
uniquely determine a and b, but only makes them to be 1 and 16, or else 2 and 4, 
respectively. 


As to the à posteriori proof, assume, for instance, t =ab + cd, y,=(a +b + (c +d), 
and so t,=ac+db, y,=(a+c)>+(d+6), &e—in the present case there are only the 
functions t,, te, t and 4%, Ya, Ys—then yı + Yat Ys, Lt bYe + tY ty + try, + tys, will 
be respectively symmetrical functions of the roots of the quartic, and therefore rational 
and integral functions of its coefficients, that is, they will be known. 


Imagine, in the first instance, that 4, t t, are all known; then the equations 
being linear in %1, Yz, Yz, these can be expressed rationally in terms of known functions 
of the coefficients and of t, t» t, that is, %, Yə ys will be known. But observe 
further, that y, is obtained as a function of t, t», t, symmetrical as regards t, t; 
it can consequently be expressed as a rational function of t and of t+, tt, or, 
what is the same thing, of 4 and 4,++4, ht: + bht: + tt, tts; but these last will 
be symmetrical functions of the roots, and as such expressible rationally in terms of 
the coefficients: that is, y, will be expressed as a rational function of ¢, and of the 
coefficients, or, 4, being known, y, will be rationally determined. 


We may consider now the question of the algebraical solution of equations, or, 
more accurately, that of the solution of equations by radicals. 
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In the case of a quadric equation 2*°+pe+q=0, we can find for s, by the 
assistance of the sign ( ) or ( )}, an expression for æ as a two-valued function 
of the coefficients p, q, such that, substituting this value in the equation, the equation 
is thereby identically satisfied, viz. we have 


z=-tptV/(tp -9Q), 
giving 
P= 3p —qt pv(tp—4q) 
+pe=-tp +tprv(tp?-49) 
+q = +q 
æ+pæ+q =0, 


and the equation is on this account said to be algebraically solvable, or, more accurately, 
to be solvable by radicals. Or we may, by writing «=—4p+z, reduce the equation 
to 2=ł}p°— q, viz. to an equation of the form 2 =q, and, in virtue of its being thus 
reducible, we may say that the equation is solvable by radicals. And the question for 
an equation of any higher order is, say of the order n, can we by means of radicals, 
1 

that is, by aid of the sign %/( ) or ( )™, using as many as we please of such 
signs and with any values of m, find an n-valued function (or any function) of the 
coefficients, which substituted for æ in the equation shall satisfy it identically. 


It will be observed that the coefficients p, q,... are not explicitly considered as 
numbers, but that even if they do denote numbers, the question whether a numerical 
equation admits of solution by radicals is wholly unconnected with the before-mentioned 
theorem of the existence of the n roots of such an equation. It does not even follow 
that, in the case of a numerical equation solvable by radicals, the algebraical expression 
of æ gives the numerical solution; but this requires explanation. Consider, first, a 
numerical quadric equation with imaginary coefficients; in the formula æ= — 4p + V(4p— q), 
substituting for p, q their given numerical values we obtain for æ an expression of the 
form æ=a+ Bi +y(y+ õi), where a, B, y, & are real numbers; this value substituted 
in the numerical equation would satisfy it identically and it is thus an algebraical 
solution; but there is no obvious @ priori reason why the expression (y + 6) should 
have a value =c¢ +di, where c and d are real numbers calculable by the extraction 
of a root or roots of real numbers; it appears upon investigation that (y+ èi) has 
such a value calculable by means of the radical expression „y {/(y? + &) + y}; and hence 
that the algebraical solution of a quadric equation does in every case give the 
numerical solution of a numerical quadric. The case of a numerical cubic will be 


considered presently. 


A cubic equation can be solved by radicals, viz. taking for greater simplicity the 
cubic in the reduced form «*-— ge—r=0, and writing a=a+b, this will be a solution 


if only 3ab=q, and a?+b*=r, or say $ (a° +b)=ġr; whence 
£(@- b)=+ V(r — Hg), 
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and therefore 


a=V/ {gr t Var — irt) 


a six-valued function of q, r. But then writing b= st, we have, as may be shown, 


a+b a three-valued function of the coefficients; it would have been wrong to com- 
plete the solution by writing b=,/{4r+/(47°—4¢)}, since here (a+b) would be 
given as a 9-valued function, having only 3 of its values roots, and the other 6 values 
being irrelevant. An interesting variation of the solution is to write æ= ab (a +b), 


giving ab?(a?+b*)=r and 3ab =q, or say 4(a°+b*)=3 : , ab? =41q; whence 


pa- Sa-k) 


and therefore 
a= Wta- b= REFE Nar- oh, 


and here although a, b are each of them a 6-valued function, yet, as may be shown, 
ab (a+b) is only a 3-valued function. 


In the case of a numerical cubic, even when the coefficients are real, substituting 
their values in the expression 


v= (hr V(b — A fr VG- ae) 


this may depend on an expression of the form (y+ ôi), where y and 6 are real 
numbers (viz. it will do so if 47°— 9° is a negative number), and here we cannot 
by the extraction of any root or roots of real numbers reduce \/(y+ ôi) to the form 
c+di, c and d real numbers; hence, here the algebraical solution does not give the 
numerical solution. It is to be added that the case in question, called the “irreducible 
case,” is that wherein the three roots of the cubic equation are all real; if the roots 
are one real and two imaginary, then, contrariwise, the quantity under the cube root is 
real, and the algebraical solution gives the numerical one. 


The irreducible case is solvable by a trigonometrical formula, but this is not a 
solution by radicals; it consists, in effect, in reducing the given numerical cubic (not 
to a cubic of the form z*=a, solvable by the extraction of a cube root, but) to a 
cubic of the form 40*—3z2=a, corresponding to the equation 4cos*@—3 cos 0 = cos 30 
which serves to determine cos @ when cos 30 is known. 


A quartic equation is solvable by radicals; and it may be remarked, that the 
existence of such a solution depends on the existence of 3-valued functions such as 
ab+cd, of the four roots (a, b, c, d); by what precedes, ab+cd is the root of a cubic 
equation, which equation is solvable by radicals; hence ab+cd can be found by radicals; 
and since abed is a given value, ab and cd can each be found by radicals, But by 
what precedes, if ab be known, then any similar function, say a+b, is obtainable 
rationally; and, consequently, from the values of a+b and ab we may by radicals 
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obtain the value of a@ or b, that is, an expression for a root of the given quartic 
expression; the expression finally obtained is 4-valued, corresponding to the different 
values of the several radicals which enter therein, and we have therefore the expression 
by radicals of each of the four roots of the quartic equation. But when the quartic 
is numerical, the same thing arises as in the cubic: the algebraical expression does 
not in every case give the numerical one. 


It will be understood from the foregoing explanation as to the quartic, how in 
the next following case, that of a quintic equation, the question of the solvability by 
radicals depends on the existence or non-existence of k-valued functions of the five 
roots (a, b, c, d, e); a fundamental theorem on the subject is that a rational function 
of 5 letters, if it has less than 5, cannot have more than 2 values; viz. that there 
are no 3-valued, or 4-valued, functions of 5 letters; and by reasoning, depending in 
part upon this theorem, Abel showed that a general quintic equation is not solvable 


by radicals: and @ fortiori the general equation of any order higher than 5 is not 
solvable by radicals. 


The general theory of the solvability of an equation by radicals depends very 
much on Vandermonde’s remark, that supposing an equation is solvable (by radicals) 
and that we have therefore an algebraical expression of æ in terms of the coefficients, 
then substituting for the coefficients their values in terms of the roots, the resulting 
value of the expression must reduce itself to any one at pleasure of the roots a, b, ¢,...; 
thus in the case of the quadric equation where the solution is e=+}p+/(fp?—4q), 
writing for p, q their values a+b, ab, this is «=}[(a+b)+ V{(a—b)}], =a or b 
according to the value of the radical. But it is not considered necessary in the 
present sketch to go further into the theory of the solvability of an equation by 
radicals, It may pe proper to remark that, for quintic equations, there are . solutions 
analogous to the trigonometrical solution of a cubic equation, viz. the quintic equation 
is here in effect reduced to some special form of quintic equation; for instance, to 
Jerrard’s form a°+aex+b=0 or to some form presenting itself in the theory of elliptic 
functions; but the solutions in question are not solutions by radicals. And there are 
various other interesting parts of the theory which have been excluded from consideration. 
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